Evolving a Locally Optimized Instance Based Learner

نویسندگان

  • Ulf Johansson
  • Rikard König
  • Lars Niklasson
چکیده

* U. Johansson and R. König are equal contributors to this paper. Abstract Standard kNN suffers from two major deficiencies, both related to the parameter k. First of all, it is well-known that the parameter value k is not only extremely important for the performance, but also very hard to estimate beforehand. In addition, the fact that k is a global constant, totally independent of the particular region in which an instance to be classified falls, makes standard kNN quite blunt. In this paper, we introduce a novel instance-based learner, specifically designed to avoid the two drawbacks mentioned above. The suggested technique, named G-kNN, optimizes the number of neighbors to consider for each specific test instance, based on its position in input space; i.e. the algorithm uses several, locally optimized k’s, instead of just one global. More specifically, G-kNN uses genetic programming to build decision trees, partitioning the input space in regions, where each leaf node (region) contains a kNN classifier with a locally optimized k. In the experimentation, using 27 datasets from the UCI repository, the basic version of GkNN is shown to significantly outperform standard kNN, with respect to accuracy. Although not evaluated in this study, it should be noted that the flexibility of genetic programming makes sophisticated extensions, like weighted voting and axes scaling, fairly straightforward.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Built-In Learner Participation Potential of Locally- and Globally-Designed ELT Materials

This study aims at empirically measuring a universal criterion for materials evaluation, i.e., learning opportunities, in a locally- and a globally-designed materials. Adopting the conceptual framework of sociocultural theory and its conceptualization of learning as participation (Donato, 2000), the researchers utilized the methodological power of conversation analysis to examine how opportunit...

متن کامل

بازیابی تعاملی تصاویر طبیعت با بهره گیری از یادگیری چند نمونه ای

Content-based image retrieval (CBIR) has received considerable research interest in the recent years. The basic problem in CBIR is the semantic gap between the high-level image semantics and the low-level image features. Region-based image retrieval and learning from user interaction through relevance feedback are two main approaches to solving this problem. Recently, the research in integra...

متن کامل

The Characterisation of Predictive Accuracy and Decision Combination

In this paper, we rst explore an intrinsic problem that exists in the theories induced by learning algorithms. Regardless of the selected algorithm, search methodology and hypothesis representation by which the theory is induced, one would expect the theory to make better predictions in some regions of the description space than others. We term the fact that an induced theory will have some reg...

متن کامل

A Feature-Weighted Instance-Based Learner for Deep Web Search Interface Identification

Determining whether a site has a search interface is a crucial priority for further research of deep web databases. This study first reviews the current approaches employed in search interface identification for deep web databases. Then, a novel identification scheme using hybrid features and a feature-weighted instance-based learner is put forward. Experiment results show that the proposed sch...

متن کامل

Robot Learners: Interactive Instance-based Learning and Its Application to Therapeutic Tasks

Programming a robot to perform tasks requires training that is beyond the skill level of most individuals. To address this issue, we focus on developing a method that identifies keywords used to convey task knowledge among people and a framework that uses these keywords as conditions for knowledge acquisition by the robot learner. The methodology includes generalizing task modeling and providin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008